An Efficient Global Discretization Method Technical Report Number 296
نویسندگان
چکیده
The development of an effective and efficient method for discretization of continuous variables is an important problem to be solved in developing generally applicable methods for data mining. In Ho and Scott 1997, we describe a new technique for discretization of continuous variables based on zeta, a measure of strength of association between nominal variables. The old zeta method partitions a continuous variable into as many sub-ranges as the values of the classification variable. In this paper, we introduce a generalised form of zeta that remove this restriction. The new method is also more efficient particularly when there are more than 4 distinct values in the classification variable. A series of experimental evaluations were performed comparing the improved zeta, together with the old zeta and C4.5 which employs its own local discretization method. Surprisingly, there were little difference in accuarcies produced by all the 3 methods, however, the improved zeta discretization technique runs considerably faster in all cases. We conclude that zeta global discretization offers advantages over C4.5.
منابع مشابه
Efficient Analysis of Plasmonic circuits using Differential Global Surface Impedance (DGSI) Model
Differential global surface impedance (DGSI) model, a rigorous approach, has been applied to the analysis of three dimensional plasmonic circuits. This model gives a global relation between the tangential electric field and the equivalent surface electric current on the boundary of an object. This approach helps one bring the unknowns to the boundary surface of an object and so avoid volumetric...
متن کاملTechnical Report No: BU-CE-1001 A Discretization Method based on Maximizing the Area Under ROC Curve
We present a new discretization method based on Area under ROC Curve (AUC) measure. Maximum Area under ROC Curve Based Discretization (MAD) is a global, static and supervised discretization method. It discretizes a continuous feature in a way that the AUC based only on that feature is to be maximized. The proposed method is compared with alternative discretization methods such as Entropy-MDLP (...
متن کاملLocal and Global Approaches to Fracture Mechanics Using Isogeometric Analysis Method
The present research investigates the implementations of different computational geometry technologies in isogeometric analysis framework for computational fracture mechanics. NURBS and T-splines are two different computational geometry technologies which are studied in this work. Among the features of B-spline basis functions, the possibility of enhancing a B-spline basis with discontinuities ...
متن کاملA Local-in-Space-Timestep Approach to a Finite Element Discretization of the Heat Equation with a Posteriori Estimates
A new numerical method is presented for the heat equation with discontinuous coefficients based on a Crank–Nicolson scheme and a conforming finite element space discretization. In the proposed method each node of the spatial discretization may have the global timestep split into an arbitrary number of local substeps in order to pursue a local improvement of the time discretization in the region...
متن کاملComputer Science Technical Report TR - 07 - 12 ( 955 ) March 22 , 2007
This paper constructs multirate time discretizations for hyperbolic conservation laws that allow different timesteps to be used in different parts of the spatial domain. The proposed family of discretizations is second order accurate in time and has conservation and linear and nonlinear stability properties under local CFL conditions. Multirate timestepping avoids the necessity to take small gl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998